LRR Conservation Mapping to Predict Functional Sites within Protein Leucine-Rich Repeat Domains
نویسندگان
چکیده
Computational prediction of protein functional sites can be a critical first step for analysis of large or complex proteins. Contemporary methods often require several homologous sequences and/or a known protein structure, but these resources are not available for many proteins. Leucine-rich repeats (LRRs) are ligand interaction domains found in numerous proteins across all taxonomic kingdoms, including immune system receptors in plants and animals. We devised Repeat Conservation Mapping (RCM), a computational method that predicts functional sites of LRR domains. RCM utilizes two or more homologous sequences and a generic representation of the LRR structure to identify conserved or diversified patches of amino acids on the predicted surface of the LRR. RCM was validated using solved LRR+ligand structures from multiple taxa, identifying ligand interaction sites. RCM was then used for de novo dissection of two plant microbe-associated molecular pattern (MAMP) receptors, EF-TU RECEPTOR (EFR) and FLAGELLIN-SENSING 2 (FLS2). In vivo testing of Arabidopsis thaliana EFR and FLS2 receptors mutagenized at sites identified by RCM demonstrated previously unknown functional sites. The RCM predictions for EFR, FLS2 and a third plant LRR protein, PGIP, compared favorably to predictions from ODA (optimal docking area), Consurf, and PAML (positive selection) analyses, but RCM also made valid functional site predictions not available from these other bioinformatic approaches. RCM analyses can be conducted with any LRR-containing proteins at www.plantpath.wisc.edu/RCM, and the approach should be modifiable for use with other types of repeat protein domains.
منابع مشابه
Human leucine-rich repeat proteins: a genome-wide bioinformatic categorization and functional analysis in innate immunity.
In innate immune sensing, the detection of pathogen-associated molecular patterns by recognition receptors typically involve leucine-rich repeats (LRRs). We provide a categorization of 375 human LRR-containing proteins, almost half of which lack other identifiable functional domains. We clustered human LRR proteins by first assigning LRRs to LRR classes and then grouping the proteins based on t...
متن کاملIdentification and Mutational Analysis of Arabidopsis FLS2 Leucine-Rich Repeat Domain Residues That Contribute to Flagellin Perception W
Mutational, phylogenetic, and structural modeling approaches were combined to develop a general method to study leucine-rich repeat (LRR) domains and were used to identify residues within the Arabidopsis thaliana FLAGELLIN-SENSING2 (FLS2) LRR that contribute to flagellin perception. FLS2 is a transmembrane receptor kinase that binds bacterial flagellin or a flagellin-based flg22 peptide through...
متن کاملScribble protein domain mapping reveals a multistep localization mechanism and domains necessary for establishing cortical polarity.
The Drosophila tumor suppressor protein Scribble is required for epithelial polarity, neuroblast polarity, neuroblast spindle asymmetry and limiting cell proliferation. It is a member of the newly described LAP protein family, containing 16 leucine rich repeats (LRRs), four PDZ domains and an extensive carboxyl-terminal (CT) domain. LRR and PDZ domains mediate protein-protein interactions, but ...
متن کاملIdentification and mutational analysis of Arabidopsis FLS2 leucine-rich repeat domain residues that contribute to flagellin perception.
Mutational, phylogenetic, and structural modeling approaches were combined to develop a general method to study leucine-rich repeat (LRR) domains and were used to identify residues within the Arabidopsis thaliana FLAGELLIN-SENSING2 (FLS2) LRR that contribute to flagellin perception. FLS2 is a transmembrane receptor kinase that binds bacterial flagellin or a flagellin-based flg22 peptide through...
متن کاملBiophysical Analysis of Anopheles gambiae Leucine-Rich Repeat Proteins APL1A1, APL1B and APL1C and Their Interaction with LRIM1
Natural infection of Anopheles gambiae by malaria-causing Plasmodium parasites is significantly influenced by the APL1 genetic locus. The locus contains three closely related leucine-rich repeat (LRR) genes, APL1A, APL1B and APL1C. Multiple studies have reported the participation of APL1A—C in the immune response of A. gambiae to invasion by both rodent and human Plasmodium isolates. APL1C form...
متن کامل